智能论文笔记

OpenStreetMap-based Autonomous Navigation With LiDAR Naive-Valley-Path Obstacle Avoidance

Miguel Angel Munoz-Banon , Edison Velasco-Sanchez , Francisco A. Candelas , Fernando Torres

分类：机器人

2021-08-20

在本文中，我们为非结构化的户外环境提供了一个完整的自主导航管道。这项工作的主要贡献位于路径规划模块上，我们分为两个主要类别：全局路径规划（GPP）和本地路径规划（LPP）。对于环境表示，而不是复杂和重型网格图，GPP层使用直接从OpenStreetMaps（OSM）获得的道路网络信息。在LPP层中，我们使用新颖的天真谷路（NVP）方法来生成局部路径，避免实时障碍物。这种方法使用LIDAR传感器使用本地环境的天真表示。此外，它使用了一个天真的优化，用于利用成本图中的“谷”区域的概念。我们在研究平台蓝色实验上实验展示了该系统的稳健性，在阿利坎特大学科学园区自主驾驶超过20公里，在12.33公顷地区。

translated by 谷歌翻译

Understanding Postpartum Parents' Experiences via Two Digital Platforms

Xuewen Yao , Miriam Mikhelson , Megan Micheletti , Eunsol Choi , S Craig Watkins , Edison Thomaz , Kaya De Barbaro

分类：自然语言处理

2022-12-22

Digital platforms, including online forums and helplines, have emerged as avenues of support for caregivers suffering from postpartum mental health distress. Understanding support seekers' experiences as shared on these platforms could provide crucial insight into caregivers' needs during this vulnerable time. In the current work, we provide a descriptive analysis of the concerns, psychological states, and motivations shared by healthy and distressed postpartum support seekers on two digital platforms, a one-on-one digital helpline and a publicly available online forum. Using a combination of human annotations, dictionary models and unsupervised techniques, we find stark differences between the experiences of distressed and healthy mothers. Distressed mothers described interpersonal problems and a lack of support, with 8.60% - 14.56% reporting severe symptoms including suicidal ideation. In contrast, the majority of healthy mothers described childcare issues, such as questions about breastfeeding or sleeping, and reported no severe mental health concerns. Across the two digital platforms, we found that distressed mothers shared similar content. However, the patterns of speech and affect shared by distressed mothers differed between the helpline vs. the online forum, suggesting the design of these platforms may shape meaningful measures of their support-seeking experiences. Our results provide new insight into the experiences of caregivers suffering from postpartum mental health distress. We conclude by discussing methodological considerations for understanding content shared by support seekers and design considerations for the next generation of support tools for postpartum parents.

translated by 谷歌翻译

Detection and depth estimation for domestic waste in outdoor environments by sensors fusion

Ignacio de L. Páez-Ubieta , Edison Velasco-Sánchez , Santiago T. Puente , Francisco A. Candelas

分类：机器人

2022-11-08

In this work, we estimate the depth in which domestic waste are located in space from a mobile robot in outdoor scenarios. As we are doing this calculus on a broad range of space (0.3 - 6.0 m), we use RGB-D camera and LiDAR fusion. With this aim and range, we compare several methods such as average, nearest, median and center point, applied to those which are inside a reduced or non-reduced Bounding Box (BB). These BB are obtained from segmentation and detection methods which are representative of these techniques like Yolact, SOLO, You Only Look Once (YOLO)v5, YOLOv6 and YOLOv7. Results shown that, applying a detection method with the average technique and a reduction of BB of 40%, returns the same output as segmenting the object and applying the average method. Indeed, the detection method is faster and lighter in comparison with the segmentation one. The committed median error in the conducted experiments was 0.0298 ${\pm}$ 0.0544 m.

translated by 谷歌翻译

Towards Parameter-Efficient Integration of Pre-Trained Language Models In Temporal Video Grounding

Erica K. Shimomoto , Edison Marrese-Taylor , Hiroya Takamura , Ichiro Kobayashi , Hideki Nakayama , Yusuke Miyao

分类：计算机视觉 | 自然语言处理

2022-09-26

本文探讨了时间视频接地（TVG）的任务，在该任务中，给定未修剪的视频和查询句子，目标是在提供的自然语言查询描述的视频中识别和确定动作实例的时间界。最近的作品通过使用大型预训练的语言模型（PLM）直接编码查询来解决此任务。但是，很难隔离改进的语言表示的影响，因为这些作品还提出了视觉输入的改进。此外，这些PLM大大增加了训练TVG模型的计算成本。因此，本文研究了PLM在TVG任务中的影响，并根据适配器评估了NLP参数效率培训替代方案的适用性。我们将流行的PLM与选择现有方法和测试不同的适配器相结合，以减少其他参数的影响。我们在三个具有挑战性的数据集上的结果表明，当TVG模型对该任务进行微调时，可以从PLM中受益匪浅，并且适配器是完全微调的有效替代方法，即使它们并不适合我们的任务。具体而言，适配器有助于节省计算成本，从而使PLM集成在较大的TVG模型中，并提供与最先进模型相当的结果。最后，通过对TVG中不同类型的适配器进行基准测试，我们的结果阐明了哪种适配器最适合每个研究的情况。

translated by 谷歌翻译

Orthogonal Gated Recurrent Unit with Neumann-Cayley Transformation

Edison Mucllari , Vasily Zadorozhnyy , Cole Pospisil , Duc Nguyen , Qiang Ye

分类：机器学习

2022-08-12

近年来，使用正交矩阵已被证明是通过训练，稳定性和收敛尤其是控制梯度来改善复发性神经网络（RNN）的一种有希望的方法。通过使用各种门和记忆单元，封闭的复发单元（GRU）和长期短期记忆（LSTM）体系结构解决了消失的梯度问题，但它们仍然容易出现爆炸梯度问题。在这项工作中，我们分析了GRU中的梯度，并提出了正交矩阵的使用，以防止梯度问题爆炸并增强长期记忆。我们研究了在哪里使用正交矩阵，并提出了基于Neumann系列的缩放尺度的Cayley转换，以训练GRU中的正交矩阵，我们称之为Neumann-cayley Orthoconal orthoconal Gru或简单的NC-GRU。我们介绍了有关几个合成和现实世界任务的模型的详细实验，这些实验表明NC-GRU明显优于GRU以及其他几个RNN。

translated by 谷歌翻译

An Empirical Deep Dive into Deep Learning's Driving Dynamics

Charles Edison Tripp , Jordan Perr-Sauer , Lucas Hayne , Monte Lunacek

分类：机器学习

2022-07-25

我们提出了一个经验数据集，该数据集对完全连接的网络上的深度学习现象进行了调查，其中包括众多网络拓扑的训练和测试性能，跨越了多个学习任务，深度，免费参数，学习率，批次大小和正则化罚款。该数据集探针1.78千名高参数设置平均每次重复20次，总共350万次培训和20个绩效指标，每次观察到的131亿个培训时期。累积了671 GB数据集，使用了5,448个CPU核心年，17.8 GPU年和111.2个节点年。此外，我们提供了初步分析，揭示了跨学习任务和拓扑的模式。我们旨在激发经验研究现代机器学习技术的工作，以此作为催化剂的催化剂，以进步能源密集型和启发式实践所需的领域。

translated by 谷歌翻译

LocFormer: Enabling Transformers to Perform Temporal Moment Localization on Long Untrimmed Videos With a Feature Sampling Approach

Cristian Rodriguez-Opazo , Edison Marrese-Taylor , Basura Fernando , Hiroya Takamura , Qi Wu

分类：计算机视觉

2021-12-19

我们提出了Locommer，一种基于变压器的视频接地模型，其在恒定的存储空间中运行，无论视频长度如何，即帧数。 Locommer专为任务而设计，在那里需要处理整个长视频，并在其核心贴上两个主要贡献。首先，我们的模型包含一种新的采样技术，将输入要素序列分成固定数量的部分，并使用随机方法选择每个部分的单个特征，这允许我们获得代表视频内容的特征样本集在手中的任务，同时保持内存占用空间。其次，我们提出了一种模块化设计，将功能分开，使我们能够通过监督自我关注头来学习归纳偏差，同时还有效利用预先接受训练的文本和视频编码器。我们在相关的基准数据集中测试我们的建议，以进行视频接地，表明该表现形式不仅可以实现优异的结果，包括在YouCookii上的最先进的性能，也可以比竞争对手更有效，并且它一直有效在平均工作的情况下，最新工作的表现，均值较大，最终导致Chardes-STA的新的最先进的性能。

translated by 谷歌翻译